WhatsApp data

Each folder contains 3 files, corresponding to one region.

In India, the data was collected in a village in Jharkand, India.
In Indonesia in Jakarta among college students.
In Colombia, primarily in Cali.

We are releasing only content that is marked as `forwarded many times' by WhatsApp (considered viral).

- messageStats.csv contains metadata on who sent (user_id) what message (message_id), when (timestamp), and where (group_id).
- messageContent.csv contains the content of the messages (if text) or link to the content (only if image or video).
- group_info.csv contains the information on the name of the group.
- content_sample contains a sample of the actual image or video files (due to upload limits by Harvard dataverse). Please contact kiran.garimella@rutgers.edu for the full content (only for academic use). The full dataset with all the content is 29G.

We are not releasing the demographics of the donators because of the private nature of WhatsApp.
